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ABSTRACT 

In  this  paper,  we  propose  an  efficient  multiscale  approach  for  the  segmentation  of  natural  clutter,  specifically  grass 
and  forest,  in  synthetic  aperture  radar  (SAR)  imagery.  This  method  exploits  the  coherent  nature  of  SAR  sensors.  In 
particular,  we  exploit  the  characteristic  statistical  differences  in  imagery  of  different  clutter  types,  as  a  function  of 
scale,  due  to  radar  speckle.  We  employ  a  recently  introduced  class  of  multiscale  stochastic  processes  that  provide  a 
powerful  framework  for  describing  random  processes  and  fields  that  evolve  in  scale.  We  build  models  representative 
of  each  category  of  clutter  of  interest  (i.e.  grass  and  forest),  and  use  these  models  to  segment  the  imagery  into  these 
two  clutter  classes.  The  scale-autoregressive  nature  of  the  models  allows  extremely  efficient  calculation  of  the  relative 
likelihoods  of  different  clutter  classifications  for  windows  of  SAR  imagery,  and  we  use  these  likelihoods  as  the  basis 
for  classifying  image  pixels  and  for  accurately  estimating  forest-grass  boundaries.  We  evaluate  the  performance  of  the 
technique  by  testing  it  on  0.3  meter  SAR  data  gathered  with  the  Lincoln  Laboratory  Millimeter- Wave  SAR. 

Keywords:  Multiscale,  SAR,  Speckle,  Segmentation 

1  INTRODUCTION 

In  recent  years,  there  has  been  a  growing  interest  in  synthetic  aperture  radar  (SAR)  imaging  for  applications 
ranging  from  remote  sensing  to  surface  surveillance  and  automatic  target  recognition  (ATR).  For  applications  such  as 
these,  the  classification  of  various  categories  of  clutter  is  quite  important,  and  their  delineation  (i.e.  segmentation) 
can  play  a  key  role  in  the  subsequent  analysis  for  target  detection,  recognition,  and  image  compression.  In  light  of 
typical  coverage  rates  (exceeding  1  km?  js)  of  an  air-borne  SAR,  it  is  of  great  importance  to  devise  efficient  (preferably 
parallelizable)  algorithms  capable  of  meeting  the  daunting  computational  demands  of  the  resulting  data  collection. 

*This  work  was  sponsored  in  part  by  the  Advanced  Research  Projects  Agency  under  Air  Force  Contract  F19628-95-C-0002  and  Air  Force 
OfBce  of  Scientific  Research  Grant  F49620-93-1-0604  and  in  part  by  Air  Force  Office  of  Scientific  Research  under  Grant  F49620-95- 1-0083. 
Opinions,  interpretations,  conclusions,  and  recommendations  are  those  of  the  authors  and  are  not  necessarily  endorsed  by  the  United  States 
Air  Force. 
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In  this  paper,  we  apply  a  multiscale  approach  to  the  SAR  image  segmentation  problem  that  exploits  the  coherent 
nature  of  SAR  image  formation.  In  particular,  we  build  on  the  idea  of  characterizing  and  exploiting  the  scale-to-scale 
statistical  variations  in  SAR  imagery  due  to  radar  speckle.®^®  A  SAR  image  measures  the  coherent  sum  of  the  returns 
from  all  radar  scatterers  within  each  resolution  cell  (pixel).  As  the  resolution  of  the  imagery  changes,  the  set  of 
complex  scatterers  contributing  to  each  resolution  cell  changes;  similarly,  the  relative  contribution  of  each  scatterer  to 
the  resolution  cell  changes.  Because  this  is  a  coherent  process,  there  can  be  constructive  and  destructive  interference 
among  the  scatterers.  Consequently,  there  is  a  statistical  variation  in  the  imagery  from  resolution  to  resolution  (i.e. 
from  scale  to  scale).  The  statistics  of  the  variation  depend  on  the  distribution  of  the  scattering  elements  in  the 
environment.  More  specifically,  the  statistics  depend  on  the  type  of  the  clutter  being  imaged. 

To  exploit  this  phenomenon,  we  employ  a  recently  introduced  class  of  multiscale  stochastic  models. These  models 
provide  a  powerful  framework  for  describing  random  fields  that  evolve  in  scale.  The  framework  uses  a  pyramidal  tree 
structure  in  which  each  tree  node  corresponds  to  a  pixel  at  a  particular  image  location  and  resolution.  The  offspring 
of  the  node  correspond  to  the  pixels,  in  the  same  location,  at  the  next  finer  scale.  The  statistical  variability  of  the 
multiscale  imagery  is  captured  by  a  scale-recursive  stochastic  model  for  each  clutter  type. 

In  this  paper,  we  hypothesize  that  the  scale-to-scale  variation  of  distinct  clutter  types  will  differ  in  a  statistically 
significant  manner.  If  this  is  the  case,  very  efficient  algorithms  associated  with  the  multiscale  models  can  be  applied 
to  calculate  likelihoods  for  the  classification  of  individual  SAR  image  pixels  and  the  subsequent  segmentation  of  SAR 
imagery.  We  demonstrate  the  utility  of  the  multiscale  methodology  for  the  segmentation  of  regions  of  trees  and  forest 
from  open  fields  and  grass.  Such  segmentation  can  be  useful  for  ATR  systems.  For  example,  if  a  densely  forested  region 
is  identified,  performing  target  detection  in  such  a  region  is  unnecessary  because  high-frequency  SAR  is  incapable  of 
providing  imagery  of  targets  under  the  forest  canopy.  Moreover,  targets  of  interest  often  attempt  to  conceal  themselves 
near  tree  lines.  Thus,  accurate  estimation  of  tree  lines  can  be  used  to  focus  attention  of  ATR  algorithms  on  areas  of 
particular  interest. 

In  the  next  section,  we  describe  the  multiscale  framework  and  its  application  to  SAR  image  segmentation.  In 
Section  3,  we  describe  the  segmentation  of  SAR  imagery  into  different  clutter  classes.  In  Section  4,  we  evaluate  the 
performance  of  this  approach  when  applied  to  0.3  meter  resolution  SAR  imagery  collected  by  the  Lincoln  Laboratory 
Millimeter- Wave  SAR.® 


2  MULTISCALE  MODELS  OF  SAR  IMAGERY 


This  section  describes  a  multiscale  framework  for  analyzing  SAR  imagery.  Subsection  2.1  describes  the  mapping 
of  multiscale  imagery  onto  a  quadtree  structure.  Subsection  2.2  describes  a  class  of  stochastic  models  for  describing 
and  analyzing  multiscale  processes  that  are  mapped  onto  quadtrees.  Subsection  2.3  specifies  an  autoregressive  model 
for  SAR  imagery.  And,  Subsection  2.4  describes  the  estimation  of  parameters  within  the  multiscale  SAR  models. 


2.1  Quadtree  Interpretation  of  SAR  Imagery 

The  starting  point  for  model  development  is  a  multiscale  sequence  ■  ■  ■  ,Io  of  SAR  images,  where  II  and  Jo 

correspond  to  the  coarsest  and  finest  resolution  images,  respectively.  The  resolution  varies  dyadically  between  images 
at  successive  scales.  More  precisely,  we  assume  that  the  finest-scale  image  Jq  has  a  resolution  of  S  x6  and  consists  of  an 
N  X  N  array  of  pixels  (with  N  =  2^  for  some  M).  Hence,  each  coarser  resolution  image  Im  has  2“'’”A/'  x  2“™W  pixels 
and  resolution  2™^  x  2™i5.  Each  pixel,  is  obtained  by  taking  the  coherent  sum  of  complex  fine-scale  imagery 

over  2*"  x  2™  blocks,  performing  log-detection  (computing  20  times  the  log-magnitude),  and  correcting  for  DC  gain 
variations  by  subtracting  the  mean  value.®  (Note  that  the  imagery  is  converted  to  log-magnitude  because  multiscale 
recursive  models,  described  below,  have  empirically  proved  most  effective  when  using  this  representation.  Use  of  the 


Figure  1:  Sequence  of  three  multiscale  SAR  images  mapped  onto  a  quadtree.  The  pixel  value  at  scale  m  and  position 
(fc,  1)  is  denoted  by  Im.{k,l). 


complex  imagery  directly  is  impractical  because  of  the  variability  of  the  phase  in  the  imagery.  Also,  the  log-magnitude 
of  the  imagery  provides  more  well-behaved  residuals,  in  a  statistical  sense,  than  the  magnitude  imagery.)  Each  pixel 
in  image  /„  corresponds  to  four  “child”  pixels  in  image,  Im-i-  This  indicates  that  a  4*^  order  tree,  or  quadtree,  is 
natural  for  the  mapping.  Furthermore,  each  node  s  on  the  quadtree  can  be  thought  of  as  having  associated  with 
it  a  3-tuple  where  m  denotes  scale  and  {k,l)  denotes  two-dimensional  image  pixel  location.  That  is,  each 

node  s  on  the  tree  is  associated  with  one  of  the  pixels  Im{k,l)  corresponding  to  pixel  {k,l)  of  SAR  image  Im-  As  an 
example.  Figure  1  illustrates  a  multiscale  sequence  of  three  SAR  images,  together  with  the  quadtree  mapping.  Here 
the  finest-scale  SAR  imagery  is  mapped  to  the  finest  level  of  the  tree,  and  each  coarse  scale  representation  is  mapped 
to  successively  higher  levels.  Furthermore,  we  use  the  notation  I{s)  to  indicate  the  pixel  mapped  to  node  s. 


2.2  Multiscale  Stochastic  Models 

In  this  subsection,  we  describe  a  general  multiscale  modeling  framework^’^  and  its  applicability  to  the  SAR  quadtree 
representation.  Under  this  framework,  a  multiscale  process  is  mapped  onto  nodes  of  a  qth  order  tree,  where  q  is 
dependent  upon  the  manner  in  which  the  process  progresses  in  scale.  A  qth  order  tree  is  one  in  which  each  node, 
starting  at  some  root  node,  branches  off  to  q  child  nodes.  (As  described  above,  the  appropriate  representation  for 
a  multiscale  SAR  image  sequence  is  g  =  4,  a  quadtree.)  Each  level  of  the  tree  can  be  viewed  as  a  distinct  scale 
representation  of  a  random  process,  with  the  resolutions  proceeding  from  coarse  to  fine  as  the  tree  is  traversed  from 
top  to  bottom  (root  node  to  terminal  nodes).  Letting  s  denote  any  node  on  the  tree  and  sj  its  parent,  the  state 
elements  at  these  nodes  are  related  by  the  coarse-to-fine  recursion 

x(s)  =  A(s)x(s7)  -f-  B(s)w(s).  (1) 

In  this  recursion,  A(s)  and  B(s)  are  matrices  of  appropriate  size  and  the  term  ■w(s)  represents  white  driving  noise. 
The  matrix  A(s)  captures  the  deterministic  progression  from  node  57  to  node  s,  i.e.,  the  part  of  x(s)  predictable 
from  x(s7),  while  the  term  B(s)w(s)  represents  the  unpredictable  component  added  in  the  progression.  An  attractive 
feature  of  this  framework  is  the  efficiency  it  provides  for  signal  processing  algorithms.  This  stems  from  the  Markovian 
property  of  the  multiscale  model  class,  which  states  that,  conditioned  on  the  value  of  the  state  at  any  node  s,  the 
processes  defined  on  each  of  the  distinct  subtrees  extending  away  from  node  s  are  mutually  independent. 

For  the  application  of  segmenting  different  types  of  clutter  in  SAR  imagery,  a  multiscale  model  can  be  constructed 
for  each  clutter  class.  To  specify  each  model,  it  is  necessary  to  determine  the  appropriate  coefficients  in  the  matrix, 
A(s),  and  the  statistical  properties  of  the  driving  noise,  w(s).  Once  the  models  have  been  specified,  a  likelihood  ratio 
test  can  be  derived  to  segment  the  imagery  into  the  clutter  classes. 


Consider  the  problem  of  segmenting  forested  regions  from  grass  regions  in  SAR  imagery.  For  each  pixel  in  the 
image,  we  choose  between  two  hypotheses:  the  pixel  is  part  of  a  grass  region  (Hg)  or  a  forest  region  (Hf).  The  log 
likelihood  ratio  test  for  classifying  each  pixel  based  on  multiscale  imagery  is  given  by 

£  =  log  •  •  Jo  I -ff,,)] -log  .  ,/o  I  i?/)]  .  (2) 

By  invoking  the  state  space  interpretation  of  multiscale  imagery  and  exploiting  the  Markovian  property  of  the  multiscale 
models,  the  log  likelihood  ratio  test  for  the  two  competing  hypotheses  can  also  be  written, 

^  [Px(s)|x(sT),Hg(X('S)  I  -  ^log  [px(s)|X(s7),Hf(X(s)  |  X(sj),Hf)]  .  (3) 

s  s 

Here,  Px(6)|x(s7),He.  and  Px(s)|x(s7),h,  are  the  conditional  distributions  for  x(s)  given  x(s7)  for  the  two  hypothesized 
models.  In  the  next  section,  we  will  show  that  this  likelihood  test  can  be  computed  efficiently  in  terms  of  the 
distributions  for  vf{s)  under  the  two  hypotheses. 


2.3  Scale-Autoregressive  SAR  Model 


In  this  paper,  we  focus  on  a  specific  class  of  multiscale  models,  namely  scale-autoregressive  models®’^  of  the  form 

I(s)  =  ai(s)/(s7)  +  a2(s)/(s7^)  +  . . .  +  a7>(s)I(s7^)  +  w(s) ,  ai{s)  £  M  (4) 

where  w{s)  is  white  driving  noise.  For  homogeneous  regions  of  texture,  the  prediction  coefficients  (the  ai{s)  in  (4))  are 
constant  with  respect  to  image  location  for  any  given  scale.  That  is,  the  coefficients,  ai(s), . . .  ,ap{s),  depend  only  on 
the  scale  of  node  s  (denoted  by  m(s)),  and  thus  will  be  denoted  by  ai,m(s)!  •  •  •  >  ap,m(s)-  Furthermore,  the  probability 
distribution  for  w{s)  depends  only  on  m{s).  Thus,  specifying  both  the  scale- regression  coefficients  and  the  probability 
distribution  for  w{s)  at  each  scale  completely  specifies  the  model. 


Following  the  procedure  of  state  augmentation  used  in  converting  autoregressive  time  series  models  to  state  space 
models,  we  associate  to  each  node  s  a  P-dimensional  vector  of  pixel  values,  where  P  is  the  order  of  the  regression 
in  (4).  The  components  of  this  vector  correspond  to  the  SAR  image  pixel  associated  with  node  s  and  its  first  P  —  1 
ancestors.  Specifically,  we  define 


x(s)  =  [  I{s)  /(S7)  •••  J(s7^  1)  ]^. 

The  recursion  in  (1)  takes  on  the  form 


,m(s) 

^2,m(s) 

l,7n(5)  ^P,Tn(s) 

■  1  ■ 

1 

0  0 

0 

0 

x(s)  = 

0 

1  0 

0 

x(s7)  -(- 

0 

0 

0 

1  0 

_  0 

(5) 


(6) 


Thus,  for  a  model  of  the  form  (4)  or  equivalently  (6),  £  in  (3)  can  be  calculated  using 

PX(s)|X(«7)(^(5)  I  ^(■S'y))  =  Pw{s){W{s)), 

where 

W{s)  =  I{s)  -  +  ■  ■  ■+ap^rn{s)I{sj^)]  ^ 

and  Pw(s)  is  the  probability  distribution  for  w{s).  By  substituting  (7)  into  (3),  the  test  statistic  can  be  written 


^  =  X^log  [Pwis)\H .(wi-s)  I  Hg)]  -  ^log  [Pv,{s)\Hfiw{s)  1  Hf)]  . 

5  S 


Thus,  the  likelihood  test  can  be  computed  from  the  residuals,  w{s). 


(7) 

(8) 
(9) 


2.4  Identification  of  Multiscale  Models  of  SAR  Imagery 


In  order  to  use  the  multiscale  methodology  just  described,  we  need  to  identify  the  model  parameters  for  each 
clutter  class  of  interest,  namely  the  model  order  P,  the  model  coefficients  ■  ■  ■  ,o,p,m{s)  for  each  scale,  and 

the  probability  distribution  for  w(s)  at  each  scale.  To  accomplish  this,  we  choose  a  homogeneous  training  region  of 
SAR  imagery  representative  of  each  clutter  class  being  modeled.  This  region  is  subsequently  processed  to  produce  a 
sequence  of  images  ■  ■  ■  ,Io-  The  regression  coefficients  for  each  scale  k  are  obtained  by  a  standard  least-squares 

minimization, 

a*  =  arg  min  <  V  [l{s)  -  ai^kHs^)  ~  -  ap,kl{sj^)f  }  ,  (10) 

where 

=  [ai^k  a'2,k  ■■■  ap^k]^  ■ 

By  varying  the  regression  order  P,  one  can  achieve  a  tradeoff  between  computational  complexity  and  model  accuracy. 
We  have  found  that  P  =  3  is  adequate  for  our  purposes. 

To  obtain  a  statistical  characterization,  the  prediction  error  residuals  (the  w{s)  in  (4))  of  the  model  at  scale  k, 
we  evaluate  the  residuals  in  predicting  scale  k  of  the  test  homogeneous  region.  In  particular,  we  use  the  a*  found 
in  (10)  to  evaluate  all  {w(s)|m(s)  =  A;}  in  (8).  We  then  choose  a  theoretical  distribution  that  provides  a  good  fit  to 
the  normalized  histogram  of  these  residuals.  In  Subsection  4.1,  we  define  the  theoretical  distributions  chosen  for  the 
models  for  grass  and  forest  and  illustrate  their  accuracy  for  statistically  representing  the  prediction  error  residuals. 


3  NATURAL  CLUTTER  SEGMENTATION 


In  this  section,  we  describe  a  procedure  for  the  segmentation  of  SAR  imagery  consisting  primarily  of  natural  clutter. 
The  starting  point  for  this  development  is  the  construction  of  multiscale  SAR  models  for  different  clutter  types  as 
described  in  the  preceding  section.  Although  we  illustrate  the  approach  by  focusing  on  distinguishing  forested  regions 
from  grass,  the  methods  are  readily  extended  to  additional  clutter  types. 


3.1  Basic  Structure  of  the  Segmentation  Procedure 

We  employ  the  multiscale  models  constructed  for  forest  and  grass  in  the  preceding  section  to  classify  individual 
pixels  and  subsequently  segment  regions  of  clutter.  In  a  fully  optimal  method,  one  would  also  postulate  a  spatial 
random  field  model  for  each  clutter  category,  capturing,  for  example,  the  fact  that  the  classification  of  a  given  pixel 
is  very  likely  to  be  the  same  as  its  neighbors. The  use  of  such  a  model,  however,  can  increase  the  computational 
complexity  of  the  classification  algorithm  considerably.  Consequently,  we  take  a  simpler  approach  that  exploits  the 
pyramidal  structure  of  the  likelihood  calculation  for  the  multiscale  models.  Specifically,  we  classify  each  individual 
pixel  based  on  a  test  window  of  specified  size  surrounding  that  pixel. 

The  size  of  the  window  used  in  the  pixel-by-pixel  classification  must  be  chosen  carefully.  A  larger  window  provides 
a  more  accurate  classification  of  homogeneous  regions.  Using  larger  windows,  however,  increases  the  likelihood  that 
the  window  contains  a  clutter  boundary.  Thus,  keeping  the  window  size  as  small  as  possible  is  also  desirable.  As 
demonstrated  in  the  next  section,  by  examining  the  empirical  distribution  of  £  in  (2)  over  windows  of  various  size  for 
homogeneous  regions  of  grass  and  forest,  we  can  determine  the  tradeoff  between  classification  accuracy  and  window 
size.  This,  in  turn,  allows  us  to  choose  a  window  size  that  yields  adequate  performance  in  classifying  homogeneous 
regions  of  clutter. 


Whenever  a  clutter  boundary  is  present  within  a  test  window,  however,  the  validity  of  the  center  pixel  classification 
is  questionable.  This  effect,  if  not  taken  into  account,  can  result  in  a  classification  bias  near  boundaries.  To  address 
this  problem,  we  devise  a  method  to  detect  the  proximity  of  grass-forest  boundaries  as  well  as  a  procedure  to  refine 
the  subsequent  classification.  Proximity  to  boundaries  is  detected  via  a  simple  modification  of  the  decision  made 
based  on  the  test  statistic  1.  Specifically,  rather  than  comparing  £  to  a  single  threshold  to  decide  on  a  grass-or-forest 
classification,  we  compare  t  to  the  two  thresholds  a  and  h: 

i  >  a  Classify  as  Grass, 

a  >  I  >  b  Defer  decision  (possible  boundary  presence), 

(.  <  b  Classify  as  Forest. 

The  resulting  test  structure  is  illustrated  in  Figure  2,  in  which  the  box  designated  “defer”  corresponds  to  the  test 
statistic  falling  between  the  two  thresholds.  In  such  cases,  a  refinement  procedure,  described  below,  is  used  to  clas¬ 
sify  the  pixel.  As  we  describe  in  Section  4,  the  choice  of  thresholds  a  and  b  are  determined  by  examining  empirical 
distributions  of  i  for  windows  containing  boundaries  with  varying  fractions  of  forest  and  grass. 


Figure  2:  Sequence  of  steps  involved  in  initial  pixel  classification,  i)  Creation  of  multiscale  sequence  from  the  window 
region,  ii)  Evaluation  of  decision  statistic  £.  iii)  Thresholding  to  determine  center  pixel  classification. 


3.2  Deferred  Pixel  Classification 

For  pixels  where  the  classification  decision  has  been  deferred,  it  is  necessary  to  determine  the  appropriate  classifi¬ 
cation.  The  structure  of  the  multiscale  likelihood  calculations  allows  us  to  perform  this  additional  task  as  a  replication 
of  the  procedure  of  Figure  2  at  a  hierarchy  of  scales.  Recall  that  the  objective  of  this  process  is  to  classify  the  center 
pixel  of  a  window  as  either  forest  or  grass.  Consequently,  in  a  region  that  is  likely  to  contain  a  boundary,  it  is  necessary 
to  determine  on  which  side  of  the  boundary  the  center  pixel  lies.  Under  the  assumption  that  the  window  contains  only 
a  single,  smooth  boundary,  the  classification  can  be  made  by  determining  which  of  the  two  hypothesized  clutter  types 
occupies  the  majority  of  the  window.  Consequently,  each  deferred  pixel  is  reclassified  by  considering  separately  the 
four  quadrants  of  the  initial  window  region  surrounding  the  pixel.  The  classification  procedure  described  in  Subsec¬ 
tion  3.1  is  then  performed  on  each  sub-window  independently.  This  procedure  is  repeated  recursively  on  each  smaller 
sub-window  classified  as  “defer”  until  either  a  majority  rule  decision  of  the  entire  windowed  region  is  determined  or 
statistical  significance  is  lost  due  to  decreasing  sub-window  size. 


This  “progressive  refinement”  around  boundary  regions  is  depicted  in  Figure  3.  In  each  frame,  the  pixel  of  interest 
is  exaggerated  by  the  solid  square  box  in  the  center  of  the  image.  Figure  3(a)  represents  the  original  windowed 
region  about  the  center  pixel.  Based  on  the  test  illustrated  in  Figure  2  this  window  has  been  identified  as  potentially 
containing  a  grass-forest  boundary.  Note  that  the  correct  center  pixel  classification  for  this  region  is  grass,  but  due 
to  the  boundary  proximity,  the  classification  has  been  deferred.  Figures  3(b)  through  3(d)  display  the  successive 
subdivision  and  reclassification  (F,  G,  and  D  represent  classification  as  forest,  grass,  and  defer  respectively),  where  at 
each  stage  only  the  regions  deferred  at  the  preceding  stage  are  subject  to  further  examination.  Note  that  in  Figure  3(d), 
9/16  of  the  region  has  been  clcissified  as  grass,  hence  the  center  pixel  is  ultimately  classified  as  grass. 


Figure  3:  Boundary  pixel  refinement  of  typical  SAR  window  region:  (a)  Window  region  deferred  due  to  boundary 
presence,  b)  Region  divided  into  quadrants,  with  ternary  classification  results  marked  for  each  subregion,  (c)  &  (d) 
Regions  still  classified  as  boundary  further  subdivided  and  reclassified.  Classifications  G,  F,  and  D  refer  to  grass, 
forest,  and  defer  respectively.  Note  that  9/16  of  the  window  region  is  classified  as  grass;  thus,  the  pixel  is  classified  as 
grass. 


Due  to  the  pyramidal  structure  of  the  multiscale  models,  the  likelihood  calculation  can  also  be  carried  out  in  a 
pyramidal  fashion.  By  judiciously  organizing  the  calculations,  the  likelihood  ratios  for  all  of  the  sub-windows  within 
a  particular  window  can  be  calculated  with  the  same  total  computational  effort  needed  to  calculate  likelihood  for  the 
entire  window. 


4  EXPERIMENTAL  RESULTS 


We  have  applied  the  multiscale  segmentation  algorithm  to  0.3  meter  resolution  HH  polarization  SAR  imagery. 
This  imagery  was  gathered  near  Stockbridge,  New  York  with  the  Lincoln  Laboratory  Millimeter- Wave  SAR.^  In 
the  next  subsection,  we  describe  the  construction  of  the  models  on  which  the  subsequent  experiments  are  based. 
In  Subsection  4.2,  we  discuss  the  details  of  the  algorithm  design,  namely  the  choice  of  window  sizes  and  decision 
thresholds.  In  Subsection  4.3,  we  evaluate  the  performance  of  the  algorithm  for  clutter  segmentation. 


4,1  Model  Construction 

In  order  to  apply  these  methods,  it  is  necessary  to  construct  multiscale  models  for  SAR  imagery  of  grass  and  forest 
from  homogeneous  regions  of  clutter.  We  have  chosen  to  use  a  third-order  regression  for  each  model  and  to  build 
models  for  the  prediction  of  each  of  the  three  finest  resolution  images  {6  x  6,  2S  x  2S,  and  46  x  46,  with  6  =  0.3  m). 
As  described  in  Subsection  2.4,  this  implies  a  third-order  model  with  a  four-level  tree.  For  the  prediction  of  each  of 
the  three  finest  resolution  images,  three  coefficients  (ai,  02,  and  03)  must  be  specified.  Using  the  method  described  in 
Subsection  2.4,  we  determined  the  coefficient  values  given  in  Table  1.  Note  that  the  coefficients  for  the  forest  model 
are  consistently  larger,  indicating  higher  scale  to  scale  correlation.  This  inter-scale  correlation  is  consistent  with  our 


expectations  due  to  the  scatterer  distribution  in  the  image.  Grassy  regions  tend  to  have  a  large  number  of  equi-valued 
scatterers  and,  as  a  result,  a  great  number  of  scatterers  migrate  in  or  out  of  each  resolution  cell  as  resolution  is  varied. 
Hence,  one  would  expect  SAR  imagery  of  grassy  regions  to  display  less  scale  to  scale  correlation. 


Forest 

Resolution 

ai 

0-2 

0^3 

5x5 

0.5842 

0.1257 

0.0669 

25  X  25 

0.5005 

0.1222 

0.0683 

45  X  45 

0.4584 

0.1292 

0.0250 

Grass 

Resolution 

ai 

a2 

as 

Sx6 

0.5263 

0.0720 

-0.0029 

25  X  25 

0.3135 

0.0313 

-0.0064 

45  X  45 

0.2278 

0.0169 

-0.0006 

(a)  (b) 

Table  1;  Model  coefficients  for  third  order  regression  in  scale,  (a)  Grass  model  coefficients,  (b)  Forest  model  coefficients. 

To  complete  the  models,  it  is  necessary  to  also  specify  the  distributions  for  w{s)  in  (4)  for  each  scale  and  each 
clutter  type.  For  imagery  of  grass,  a  log-Rayleigh  distribution, 

Pwi3){W{s))  =  kexp[kW{s) -'T  -  exp{kW{s) -j)], 

In(lO) 

10  ’ 

7  w  0.57721566  (Euler’s  constant), 


provided  a  good  fit.  For  imagery  of  trees,  a  zero-mean  Gaussian  distribution  provided  a  good  fit;  in  particular,  the 
standard  deviations  chosen  for  the  Gaussian  density  of  w{s)  for  each  of  the  three  scales  was: 


dxd:  5.3724,  25  x  2d  :  6.1811,  45x45:6.6056 


Examples  showing  histograms  and  the  accuracy  of  the  resulting  fits  are  given  in  Figure  4. 


Figure  4:  Histograms  of  residuals  in  prediction  of  second-finest  resolution  for  (a)  Grass  model  (b)  Forest  model.  Solid 
line  represents  (a)  Log-Rayleigh  distribution  (b)  Gaussian  distribution. 


4.2  Algorithm  Specification 

It  is  necessary  to  specify  both  the  size  of  the  window  used  to  compute  the  likelihood  test  and  the  thresholds  used  at 
each  stage  in  the  hierarchical  procedure.  As  we  have  discussed,  the  choice  of  window  size  involves  a  tradeoff  between 
the  statistical  significance  of  the  test  and  the  possibility  of  clutter  boundaries  intersecting  the  window.  On  row  (i)  of 
Figure  5  we  have  displayed  histograms  of  the  value  of  the  test  statistic  £  computed  over  homogeneous  regions  of  both 
clutter  types  using  windows  of  different  sizes  (namely  128  x  128,  64  x  64,  32  x  32).  In  addition,  we  have  also  displayed 
on  row  (ii)  the  corresponding  figures  using  Gaussian  fits  to  the  histograms  of  £.  There  is  virtually  no  overlap  between 


the  histograms  for  the  larger  128  x  128  pixel  window,  with  increasing  overlap  as  the  window  size  decreases.  With  a 
32  X  32  window  there  is  distinguishing  information  between  the  two  classes,  but  an  increased  overlap  in  the  histograms 
compared  to  a  128  x  128  window.  For  the  test  results  here,  we  chose  a  test  window  with  size  128  x  128.  We  also  chose 
to  consider  subdivisions  down  to  a  size  of  32  x  32  for  subsequent  classification  of  “deferred”  pixels. 


(A-ii) 


(B-ii) 


(C-ii) 


Figure  5:  Statistical  results  used  in  determination  of  threshold  values  for  each  window  size  .  Frame  (A-i)  displays 
histograms  of  values  of  £  from  128  x  128  pixel  homogeneous  regions  of  forest  (dark)  and  grass  (light).  Frame  (A-ii) 
shows  Gaussian  estimates  of  I  for  each  clutter  category  (solid  line  for  grass  and  dashed  for  forest).  Frames  (B-i)  & 
(B-ii)  and  (C-i)  &  (C-ii)  display  similar  results  for  64  x  64  and  32  x  32  pixel  regions,  respectively. 


In  practice,  the  segmentation  algorithm  will  be  confronted  with  nonhomogeneous  imagery.  When  the  test  window 
contains  a  boundary,  the  likelihood  test  is  corrupted.  For  example,  we  have  found  that  even  modest  amounts  of 
“contamination”  by  a  forested  region  in  a  predominantly  grassy  region  can  significantly  alter  the  test  statistic  1.  This 
effectively  introduces  a  bias  in  the  decision  process.  Hence,  it  is  essential  that  forest-grass  decisions  at  each  hierarchical 
level  are  made  only  if  there  is  overriding  evidence  for  one  of  the  two  hypotheses.  Because  of  this  effect,  pairs  of  threshold 
values  are  used  to  make  a  ternary  classification  decision,  as  described  in  Subsection  3.1:  If  the  likelihood  statistic,  £, 
is  above  the  “grass”  threshold,  the  pixel  is  classified  as  grass.  If  I  is  below  the  “forest”  threshold,  the  pixel  is  classified 
as  forest.  If  the  value  falls  between  the  two  thresholds,  the  classification  is  deferred.  The  values  of  the  thresholds  used 
by  the  algorithm  are  listed  in  Table  2.  Whenever  the  classification  is  deferred,  the  classification  is  determined  the 
hierarchical  refinement  procedure  defined  in  Subsection  3.2. 


Region  Size 

Grass  Threshold 

Forest  Threshold 

128 

1000 

-1600 

64 

500 

-800 

32 

50 

0 

Table  2:  Threshold  values  for  various  window  sizes.  These  values  serve  as  thresholds  in  the  ternary  hypothesis  test  for 
classification  as  grass,  forest,  or  defer  at  each  hierarchical  level  of  the  algorithm. 


4.3  Segmentation  Performance 

Figure  6  displays  the  results  of  applying  the  algorithm  to  two  images  (i  and  ii).  The  white  line  in  each  frame 
represents  a  hand-picked  estimate  of  the  boundary,  and  dark  and  light  regions  represent  clutter  classification  as  forest 
and  grass  respectively.  Each  row  shows  the  segmentation  results  for  the  SAR  image  pictured  in  frame  (A).  Frames 


(B),  (C),  and  (D)  display  (in  order):  the  results  of  classification  using  only  a  binary  test  on  the  full  128  x  128  window 
(i.e.,  without  allowing  the  possibility  of  deferred  decisions),  the  result  of  the  ternary  classification  at  the  full  window 
size  (the  mid-tone  value  identifies  those  pixels  for  which  decision  at  the  full  128  x  128  window  level  was  deferred),  and 
the  final  results  using  the  full  hierarchical  approach.  Comparing  parts  (B)  and  (D)  in  each  row,  we  see  first  that,  as 
expected,  restriction  to  using  only  the  128  x  128  window  likelihood  test  leads  to  a  bias  in  the  estimated  boundary:  the 
tree  line  is  pushed  into  the  grassy  region.  However,  when  the  full  hierarchical  system  is  used,  the  final  segmentation 
yields  more  accurate  boundary  estimation. 


(A-ii)  (B-ii)  (C-ii)  (D-ii) 


Figure  6:  Results  for  two  images  of  0.3  meter  resolution  stripmap  SAR  data.  The  white  line  in  each  frame  represents  a 
manual  estimate  of  the  grass-forest  boundary.  (A)  Original  images.  (B)  Binary  segmentation  results  (without  allowing 
deferred  decisions).  (C)  Preliminary  classifications  using  ternary  classification  scheme  (dark  grey  =  forest,  light  grey 
=  grass,  medium  grey  =  deferred  decision)  (D)  Final  results  after  refinement  procedure. 


4.4  Computational  Requirements 

The  multiscale  segmentation  algorithm  is  extremely  efficient  compared  to  most  segmentation  approaches.  The 
multiscale  algorithm  requires  computation  of  order  N,  where  N  is  the  number  of  pixels  in  the  image  to  be  segmented. 
Furthermore,  the  algorithm  is  parallelizable  because  distinct  regions  of  clutter  can  be  segmented  independently.  These 
desirable  properties  are  a  direct  result  of  the  pyramidal  structure  of  the  multiscale  algorithms.  In  particular,  the 
likelihood  test  for  a  particular  pixel  is  not  dependent  on  the  outcome  of  its  neighbors.  Thus,  iterative  cycles  to 
propagate  the  effects  of  neighboring  pixels  are  unnecessary.  This  property  makes  the  multiscale  segmentation  algorithm 
much  more  practical  than  most  competing  segmentation  algorithms. 

In  contrast,  consider  simulated  annealing  algorithms  that  assume  Markov  random  field  models.^  Under  these 
models,  the  outcome  of  the  likelihood  test  for  an  individual  pixel  is  highly  dependent  on  the  outcome  for  each  of  its 
neighbors.  Consequently,  such  algorithms  require  many  iterative  cycles  to  converge.  In  fact,  such  algorithms  require 
order  Ne^  computations  to  converge  theoretically.  (This  results  from  the  fact  that  order  iterations  are  required 


for  convergence  and  order  N  computations  are  required  for  each  iteration.)  In  practice,  such  algorithms  require  many- 
iterative  cycles  to  converge  approximately.  Consequently,  simulated  annealing  algorithms  are  impractical  to  implement. 


5  FUTURE  WORK 


The  multiscale  methodology  described  in  this  paper  is  applicable  to  other  problems  related  to  SAR  imagery.  For 
example,  the  approach  leads  directly  to  efficient  SAR  image  compression  algorithms.  Given  a  set  of  precomputed 
multiscale  models,  the  compression  algorithm  -would  determine  the  model  that  most  efficiently  represents  each  block 
of  imagery;  an  index  would  be  transmitted  for  each  block  to  specify  the  chosen  model.  The  residuals,  w(s),  could  be 
transmitted  more  efficiently  than  the  original  imagery  because  the  residuals  are  uncorrelated  by  design. 

In  addition,  the  methodology  is  applicable  to  military  target  detection.  When  a  target  is  present  in  the  test  window, 
the  clutter  models  do  not  hold;  the  residuals  and  the  likelihoods  are  likely  to  exhibit  anomalous  behavior.  If  a  test 
for  such  anomalies  is  constructed,  it  could  be  used  as  a  mechanism  for  target  detection.  Furthermore,  it  is  possible  to 
construct  equivalent  multiscale  models  for  targets;  this  could  lead  to  direct  likelihood  tests  for  target  detection. 


6  CONCLUSION 


In  this  paper,  we  have  described  a  methodology  for  the  segmentation  of  clutter  using  multiscale  models  of  SAR 
imagery.  This  methodology  exploits  the  differences  in  scale-to-scale  variability  and  predictability  of  images  of  distinct 
clutter  types.  Within  this  methodology,  likelihood  test  calculations  are  very  efficient.  The  efficiency  results  from 
hierarchical  nature  of  the  multiscale  algorithms.  This  structure  also  yields  efficient  implementation  of  a  hierarchical 
refinement  strategy  for  improving  the  accuracy  of  segmentation  near  clutter  boundaries. 
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